The State of Machine Translation in Europe

نویسنده

  • John Hutchins
چکیده

This first half of this general survey covers MT and translation tools in use, including translators workstations, software localisation, and recent commercial and in-house MT systems. The second half covers the research scene, multilingual projects supported by the European Union, networking and evaluation. In comparison with the United States and elsewhere, the distinctive features of activity in Europe in the field of machine translation and machine-aided translation are: (i) the development and popularity of translator workstations, (ii) the strong software localisation industry, (iii) the vigorous activity in the area of lexical resources and terminology, (iv) and the broad based research on language engineering supported primarily by European Union funds. 1. Translator's workstations Recent years have seen great advances in the development and exploitation of support tools for translator. The four most widely used translator's workstations originate from Europe: Trados' Translation Workbench, IBM's TranslationManager, Eurolang's Optimizer, and STAR's Transit. In addition, Europe has been the centre for most of the background and current research on workstations: the TWB project funded by the European Union, and the Commission's own EURAMIS project. For professional translators, the attraction of the workstation is the integration of tools from simple word processing aids (spelling and grammar checkers) to full automatic translation. The translator can choose to make use of whichever tool seems most appropriate for the task in hand. The vendors of these systems always stress that translators do not have to change their work patterns; the systems aim to increase productivity with translator-oriented tools which are easy to use and fully compatible with existing word processing systems. In facilities and functions, each offer similar ranges: multilingual split-screen word processing, terminology recognition, retrieval and management, translation memory (pre-translation based on existing texts), alignment software for users to create their own bilingual text databases, retention of original text formatting, and support for very wide range of European languages, both as source and target languages. Integration to MT systems is now provided by three of the workstations. In the case of Trados access is provided to the Transcend software from Intergraph; IBM Translation Manager links up with Systran; and Eurolang Optimzer with Logos. After its disappointing experience with Eurotra, the European Commission has devoted most of its research support to the development of practical tools for translators and to the creation of essential lexical resources. Most of these projects will be described later. A major project was the TWB (Translator's Work Bench) project -within the ESPRIT framework -which began in 1989 and ended in 1994. The project, led by Triumph-Adler and involving 10 members from companies and universities, investigated the requirements of translators and proposed most of the features which are now commonplace in translator workstations: multilingual editor, document converters, access to lexica and terminology databases (e.g. Eurodicautom), access to MT systems (in this case, METAL), tools for term bank building, pre-translation and translation memory, and in particular a tool kit (System Quirk) for the analysis of texts and the development of lexical resource databases (thesauri, knowledge bases) from corpora and term banks. (A full description of the TWB project has recently been published: Kugler et al. 1995). A second project in this area has been TRANSLEARN, an LRE project for an interactive corpusbased translation drafting tool (a prototype translation memory system) based on EU regulations and directives from the CELEX (European Union law) database. The languages involved have been English, French, Portuguese and Greek. A third project, this time funded by Volkswagen, is investigating the use of a domain knowledge base integrated with a linguistic database as a translation tool; the languages are German and Bulgarian. The Translation Service of the Commission itself is now developing its own workstation: EURAMIS. The aim is to optimize the efficiency of the translation resources already available (e.g. the termbank Eurodicautom), to create a database of translated EU documents (as a 'translation memory'), and to provide easy access to MT systems. It will allow individual translators to develop their own tailor-made resources and facilities, with tools for text corpus management, glossary construction, and text alignment. A particular emphasis will be on the integration of MT and translation tools, including the mutual enrichment of Systran dictionaries and Eurodicautom lexical databases. The European strength in the area of terminology continues. A recent list of termbanks from InfoTerm in Vienna reveals that over two thirds of the 100 terminological databases recorded are based in Europe. The links between terminologists and translators have been a marked feature on the European scene -links which are now extending to the MT community. Terminology and the lexicon was the topic of an EAMT workshop in 1993 (Steffens 1995), and this August another EAMT workshop was held in conjunction with the international terminology conference in Vienna.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of English-Persian Translation of Neural Google Translation

Many studies abroad have focused on neural machine translation and almost all concluded that this method was much closer to humanistic translation than machine translation. Therefore, this paper aimed at investigating whether neural machine translation was more acceptable in English-Persian translation in comparison with machine translation. Hence, two types of text were chosen to be translated...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

A new model for persian multi-part words edition based on statistical machine translation

Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...

متن کامل

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

The state of machine translation in Europe and future prospects

The aim of using computers for translation is not to emulate or rival human translation but to produce rough translations which can serve as drafts for published translations, as gists for information gathering, and as cross-language communication aids. The field of machine translation (MT) covers the usage, research and development of computer aids and systems ranging from production systems f...

متن کامل

بهبود و توسعه یک سیستم مترجم‌یار انگلیسی به فارسی

In recent years, significant improvements have been achieved in statistical machine translation (SMT), but still even the best machine translation technology is far from replacing or even competing with human translators. Another way to increase the productivity of the translation process is computer-assisted translation (CAT) system. In a CAT system, the human translator begins to type the tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003